NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Gemstones: A Model Suite for Multi-Faceted Scaling Laws

McLeish, Sean; Kirchenbauer, John; Miller, David Yu; Singh, Siddharth; Bhatele, Abhinav; Goldblum, Micah; Panda, Ashwinee; Goldstein, Tom (February 2025, ArXiv)

Free, publicly-accessible full text available February 7, 2026
Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models

Jain, Neel; Shrivastava, Aditya; Zhu, Chenyang; Liu, Daben; Samuel, Alfy; Panda, Ashwinee; Kumar, Anoop; Goldblum, Micah; Goldstein, Tom (December 2024, ArXiv)

Full Text Available
Understanding and Mitigating Copying in Diffusion Models

Somepalli, Gowthami; Singla, Vasu; Goldblum, Micah; Geiping, Jonas; Goldstein, Tom (December 2023, NeurIPS 2023)

This paper proposes solutions to detecting and mitigating the blatant replication and memorization of data used to train text-to-image generators, especially Stable Diffusion. The potential for diffusion models to reproduce copyrighted or private images without user knowledge poses significant ethical and legal challenges. For lawmakers, this highlights the need for clear guidelines and regulations around the use of such models, especially in commercial applications.
more » « less
Full Text Available
A Simple and Efficient Baseline for Data Attribution on Images

Singla, Vasu; Sandoval-Segura, Pedro; Goldblum, Micah; Geiping, Jonas; Goldstein, Tom (December 2023, NeurIPS 2023 Workshop ATTRIB)

Data attribution methods play a crucial role in understanding machine learning models, providing insight into which training data points are most responsible for model outputs during deployment. However, current state-of-the-art approaches require a large ensemble of as many as 300,000 models to accurately attribute model predictions. These approaches therefore come at a high computational cost, are memory intensive, and are hard to scale to large models or datasets. In this work, we focus on a minimalist baseline, utilizing the feature space of a backbone pretrained via self-supervised learning to perform data attribution. Our method is model-agnostic and scales easily to large datasets. We show results on CIFAR-10 and ImageNet, achieving strong performance that rivals or outperforms state-of-the-art approaches at a fraction of the compute or memory cost. Contrary to prior work, our results reinforce the intuition that a model's prediction on one image is most impacted by visually similar training samples. Our approach serves as a simple and efficient baseline for data attribution on images.
more » « less
Full Text Available
Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery

Wen, Yuxin; Jain, Neel; Kirchenbauer, John; Goldblum, Micah; Geiping, Jonas; Goldstein, Tom (December 2023, NeurIPS 2023)

The strength of modern generative models lies in their ability to be controlled through text-based prompts. Typical "hard" prompts are made from interpretable words and tokens, and must be hand-crafted by humans. There are also "soft" prompts, which consist of continuous feature vectors. These can be discovered using powerful optimization methods, but they cannot be easily interpreted, re-used across models, or plugged into a text-based interface. We describe an approach to robustly optimize hard text prompts through efficient gradient-based optimization. Our approach automatically generates hard text-based prompts for both text-to-image and text-to-text applications. In the text-to-image setting, the method creates hard prompts for diffusion models, allowing API users to easily generate, discover, and mix and match image concepts without prior knowledge on how to prompt the model. In the text-to-text setting, we show that hard prompts can be automatically discovered that are effective in tuning LMs for classification.
more » « less
Full Text Available
A Performance-Driven Benchmark for Feature Selection in Tabular Deep Learning

Cherepanova, Valeriia; Levin, Roman; Gowthami, Somepalli; Geiping, Jonas; Bruss, C Bayan; Wilson, Andrew G; Goldstein, Tom; Goldblum, Micah (December 2023, Advances in Neural Information Processing Systems)

Academic tabular benchmarks often contain small sets of curated features. In contrast, data scientists typically collect as many features as possible into their datasets, and even engineer new features from existing ones. To prevent over-fitting in subsequent downstream modeling, practitioners commonly use automated feature selection methods that identify a reduced subset of informative features. Existing benchmarks for tabular feature selection consider classical downstream models, toy synthetic datasets, or do not evaluate feature selectors on the basis of downstream performance. We construct a challenging feature selection benchmark evaluated on downstream neural networks including transformers, using real datasets and multiple methods for generating extraneous features. We also propose an input-gradient-based analogue of LASSO for neural networks that outperforms classical feature selection methods on challenging problems such as selecting from corrupted or second-order features.
more » « less
Full Text Available
Seeing in Words: Learning to Classify through Language Bottlenecks

Saifullah, Khalid; Wei, Yuxin; Geiping, Jonas; Goldblum, Micah; Goldstein, Tom (June 2023, ICRL 2023)

Neural networks for computer vision extract uninterpretable features despite achieving high accuracy on benchmarks. In contrast, humans can explain their predictions using succinct and intuitive descriptions. To incorporate explainability into neural networks, we train a vision model whose feature representations are text. We show that such a model can effectively classify ImageNet images, and we discuss the challenges we encountered when training it.
more » « less
Full Text Available
Chroma-VAE: Mitigating Shortcut Learning with Generative Classifiers

Yang, Wanqian; Kirichenko, Polina; Goldblum, Micah; Wilson, Andrew Gordon (September 2022, NeurIPS)

Deep neural networks are susceptible to shortcut learning, using simple features to achieve low training loss without discovering essential semantic structure. Contrary to prior belief, we show that generative models alone are not sufficient to prevent shortcut learning, despite an incentive to recover a more comprehensive representation of the data than discriminative approaches. However, we observe that shortcuts are preferentially encoded with minimal information, a fact that generative models can exploit to mitigate shortcut learning. In particular, we propose Chroma-VAE, a two-pronged approach where a VAE classifier is initially trained to isolate the shortcut in a small latent subspace, allowing a secondary classifier to be trained on the complementary, shortcut-free latent subspace. In addition to demonstrating the efficacy of Chroma-VAE on benchmark and real-world shortcut learning tasks, our work highlights the potential for manipulating the latent space of generative classifiers to isolate or interpret specific correlations.
more » « less
Full Text Available
PAC-Bayes Compression Bounds So Tight That They Can Explain Generalization

Lotfi, Sanae; Finzi, Marc Anton; Kapoor, Sanyam; Potapczynski, Andres; Goldblum, Micah; Wilson, Andrew Gordon (October 2022, NeurIPS)

While there has been progress in developing non-vacuous generalization bounds for deep neural networks, these bounds tend to be uninformative about why deep learning works. In this paper, we develop a compression approach based on quantizing neural network parameters in a linear subspace, profoundly improving on previous results to provide state-of-the-art generalization bounds on a variety of tasks, including transfer learning. We use these tight bounds to better understand the role of model size, equivariance, and the implicit biases of optimization, for generalization in deep learning. Notably, we find large models can be compressed to a much greater extent than previously known, encapsulating Occam’s razor.
more » « less
Full Text Available
How much Data is Augmentation Worth?

Geiping, Jonas; Goldblum, Micah; Somepalli, Gowthami; Shwartz-Ziv; Ravid; Goldstein, Tom; Gordon-Wilson, Andrew (July 2022, ICML Workshop on Spurious Correlations, Invariance and Stability)

Full Text Available

« Prev Next »

Search for: All records